Positive Grammar Checking: A Finite State Approach
نویسندگان
چکیده
This paper reports on the development of a finite state system for finding grammar errors without actually specifying the error. A corpus of Swedish text written by children served as the data. Errors are more frequent and the distribution of the error types is different than for adult writers. Our approach (following Karttunen et al. [9]) for finding errors involves developing automata that represent two “positive” grammars with varying degree of detail and then subtracting the detailed one from the general one. The difference between the automata corresponds to a grammar for errors. We use a robust parsing strategy which first identifies the lexical head of a phrase together with the lexical string which precedes it beginning at the left margin of the phrase. The technique shows good coverage results for agreement and verb selection phenomena. In future, we aim to include also the detection of agreement between subject and predicative complements, word order phenomena and missing sentence boundaries.
منابع مشابه
A Hybrid Meta-heuristic Approach to Cope with State Space Explosion in Model Checking Technique for Deadlock Freeness
Model checking is an automatic technique for software verification through which all reachable states are generated from an initial state to finding errors and desirable patterns. In the model checking approach, the behavior and structure of system should be modeled. Graph transformation system is a graphical formal modeling language to specify and model the system. However, modeling of large s...
متن کاملLearning and Model-Checking Networks of I/O Automata
We introduce a new statistical relational learning (SRL) approach in which models for structured data, especially network data, are constructed as networks of communicating finite probabilistic automata. Leveraging existing automata learning methods from the area of grammatical inference, we can learn generic models for network entities in the form of automata templates. As is characteristic fo...
متن کاملWeakening Horn Knowledge Bases in Regular Description Logics to Have PTIME Data Complexity
In description logics, an RBox is a finite set of role axioms of the form Rs1 ◦ . . .◦Rsk v Rt, where Rs1 , . . . , Rsk , Rt are role names. A regular RBox is an RBox whose set of corresponding grammar rules t→ s1 . . . sk forms a grammar such that, for every symbol s of the grammar, the set of words derivable from s using the grammar is a regular language. We assume that a regular RBox is spec...
متن کاملReachability checking in complex and concurrent software systems using intelligent search methods
Software system verification is an efficient technique for ensuring the correctness of a software product, especially in safety-critical systems in which a small bug may have disastrous consequences. The goal of software verification is to ensure that the product fulfills the requirements. Studies show that the cost of finding and fixing errors in design time is less than finding and fixing the...
متن کاملRegular Model Checking Using Inference of Regular Languages
Regular model checking is a method for verifying infinite-state systems based on coding their configurations as words over a finite alphabet, sets of configurations as finite automata, and transitions as finite transducers. We introduce a new general approach to regular model checking based on inference of regular languages. The method builds upon the observation that for infinite-state systems...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003